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Connected structure in the 2dFGRS 
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ABSTRACT 

We describe and apply a simple prescription for denning connected structures in 
galaxy redshift surveys. The method is based upon two passes with a friends-of-friends 
groupfinder. The first pass uses a cylindrical linking volume to find galaxy groups and 
clusters, in order to suppress the line-of-sight smearing introduced by the large random 
velocities of galaxies within these deep potential wells. The second pass, performed 
with a spherical linking volume, identifies the connected components. This algorithm 
has been applied to the 2dFGRS, within which it picks out a total of 7,603 systems 
containing at least two galaxies and having a mean redshift less than 0.12. Connected 
systems with many members appear filamentary in nature, and the algorithm recovers 
two particularly large filaments within the 2dFGRS. For comparison, the algorithm is 
has also been applied to ACDM mock galaxy surveys. While the model population of 
such systems is broadly similar to that in the 2dFGRS, it does not generally contain 
such extremely large structures. 
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1 INTRODUCTION 

At very large scales, baryonic material is concentrated 
into an interco nnected sponge-like network know n as the 
Cosmic Web (|Bond. Kofman fc Pogosvanl 119961 1. succes- 
sive redshift surveys have traced out imposing overdensi- 
ties in the galaxy distribution. Notable e xamples are the 
CfA 'Great Wall' jGeller fc Huchral 1 19891 ) and the 'Sloan 
Great Wall' l|Gott et al.ll2005h . which was also noted, if not 
named, in the Two-degree F ield Galaxy Redshift S urvey 
(2dFGRS. IColless et alj|200ll ) by iBaugh et all i|2004l ) and 
lErdogdu et all (|2004l ). 

Most studies of large-scale structure in the Universe 
concentrate on m easuring the galaxy power spectrum (e.g. 
ICole et all [20051 ) or its Fourier transform, the two-point 
correlation function (e.g. IZehavi et al.|[2002l ). These quan- 
tities will provide a complete statistical description of the 
galaxy distribution provided that their number density 
fluctuations are Gaussian. The standard model of structure 
formation, ACDM, does assume that the initial inhomo- 
geneities in the density field, generated during inflation, 
are Gaussian. However, the subsequent growth of structure 
due to gravitational instability induces significant non- 
Gaussianities in the density field, and higher order moments 
of the density distribution become important. These phase 
correlations within the density field can be characterised 
either through higher order galaxy cor r elation functions 
jBaugh et ail 12004 ICroton et all [20041 ; iGaztanaga et all 
120051 ; iNichol et all 12006) or through the properties and 



distribution of filaments. Quantitative studies of filamen- 
tar y structures in red s hift s u rveys have been performed 
by iBhavsar fc Barrowl d 19831) ; Ib arrow. Bhavsar fc Sonodal 
(Il985l) ; lEinasto et all (|2003l); iPimbblet fc Drinkwatei 
<|2004h and lStoica. Martinez fc Saad (|2010l ). 

A variety of algorithms have been designed to 
describe the morphology of these large-scale struc- 
tures using different t echniques such as percolation 



(Bh avsar fc Barrowl [19831) . visual identification of regions 
between clusters llPimbblet. Drinkwater fc Hawkrigd I200I ; 



IColberg. Krughoff fc Connolly! |2005FT~ mi nimal spannin g 
trees (|Barrow. Bhavsar fc Sonodal Il985l; IColberd [20071) 
the density field's Hessian (lAragon-Calvo et al.l 20071 ; 
iBond. Strauss fc CerJ |2010|; IZhang et all 120091 ), gradient 
(Morse theory. iNovikov. ColombT fc Dore 2006) or linkage 
between the two (ISousbie et al.l l2008alfbh . the Hessian of 



the potential field (IHahn et al 
120a 



'9), 



|2000| ; lAragon-Calvo et all 120071), the Candy and Bisous 



20071; I Forero- Romero et al 



Delaunay tessellations ( Schaap fc van de Weygaert 



models 1 Stoica et al. 20051 ; Stoica. Martinez fc Saai 20ld) 
and watershed transforms (Sousbie. Colombi fc Pichonl 



120091 ; lAragon-Calvo et al.ll2Q10l ). Many of these algorithms 
also partition the whole of space into clusters, walls, fil- 
aments and voids. They are often applied to dark mat- 
ter simulations to help describe t he mass distribution, but 
with a few notable except i ons jBhavsar fc Barrow 19831; 



Barrow. Bhavsar fc Sonodal Il985l ; |pimbblet fc Drinkwater! 
2004 1; IBond. Strauss fc Cenl l20ld ; IStoica. Martinez fc Saan 
2010), they rarely include a comparison with observational 
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data. A primary motivation for this paper is to carry out a 
detailed quantitative comparison of filament properties us- 
ing the 2dFGRS and mock galaxy catalogues created us- 
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ing a ACDM simulation l|Angulo et al. 2008TI and a semi- 
analytical galaxy formation model i Baugh et alj |2005). 

An important aspect of the comparison between model 
and observed large-scale structure concerns the existence in 
both the 2dFGR S and the Sloan Digital Sky Survey (SDSS; 
lYork et al,ll200d ) of some extremely large structures. These 
objects are known to have a large impact on the higher order 
correlations of the g alaxy distribution llBaugh et al. 20041; 
Croton et ail 12004 iGaztafiaga et al l 120051 ; iNichol et"afl 



2006), and on its topology I Park et al. [2005'). How common 



such structures are within the ACDM model remains con- 
tentious (Yaryura, Baugh & Angulo 2010). 

In Section 2, we describe the detection algorithm that 
we have developed, and the observational and mock data 
that will be compared. The results of this comparison are 
described in Section 3 and conclusions drawn in Section 4. 



2 METHODS 

In this section, we will describe the observational data that 
will be analysed, the algorithm for finding connected sys- 
tems, the calculation of the luminosity of such objects, and 
the mock catalogues that will be used in order to compare 
the ACDM model with the observational data. 



2.1 Data 

The observational data used in this study are the two large 
contiguous patches towards the north and south galactic 
poles (NGP and SGP) in the final data release of the 2dF- 
GRS jColless et al.ll200ll ). In total, these regions contain 
191,897 galaxies with a median redshift of 0.11, covering 
an area of approximately 1500 deg 2 to a flux limit corre- 
sponding to 6j ~ 19.45. A bright flux limit of bj < 14 is also 
imposed to exclude objects whose total fluxes are difficult 
to determine from APM photographic plates. 

2.2 The detection algorithm 

A desirable property of an algorithm extracting large-scale 
structure is that there should be no preferred direction for 
the resultant systems. However, redshift space distortions 
make the line of sight a special direction in galaxy redshift 
surveys. The most striking consequence of non-Hubble flow 
veloci ties is to stret ch galaxy clusters, creating 'fingers of 
god' (Jackson 1972) in the redshift-space galaxy distribu- 
tion. These elongated redshift-space distortions need to be 
removed before searching for real structures. We achieve this 
by taking t he 2PIGG catalog ue of groups and clusters con- 
structed bv lEke et akl <|2004ah from the 2dFGRS. These were 
found using a friends-of-friends algorithm with a cylindri- 
cal linking volume pointing along the redshift direction as 
described in their paper. Having found galaxies belonging 
to groups and clusters in this way, we would like to col- 
lapse the 'fingers of god' by placing these galaxi es at their 
;roup c entre positions. One complication is that lEke et all 
2004a) note that they would expect the 2PIGG catalogue 
to contain a few tens of per cent of interloper galaxies that 
are incorrectly assigned to groups. To try and correct for 
this inevitable misassignment, we choose to retain a redshift- 
dependent fraction 
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Figure 1. Variation in the number of 2dFGRS galaxies in sys- 
tems extracted by the algorithm as the relative linking length, 
b, changes. Black lines denote galaxies from the NGP, red lines 
from the SGP. Solid lines represent the number of galaxies in 
the largest structure, whilst the dashed lines show the number of 
galaxies in all systems with at least two members. 



/(*) = 



2.4 + z 



(X) 



of the members assigned to each group. This expression for 
f(z) is derived from the cont amination o f group s found in 
mock 2dFGRS catalogues by lEke et all (|2004aT ). The ran- 
domly selected fraction f(z) are all replaced by a single 
point at the group centre, whereas the 1 — f(z) of 'interloper' 
galaxies are jettisoned from the list of group members and 
left at their observed redshift space positions. 

The first friends-of-friends pass suppresses the redshift 
space distortions associated with intragroup line-of-sight 
galaxy velocities. Note that this collapse does not account 
for the coherent infall of galaxies onto overdensities that will 
enhance and merge structure in the plane of t he sky (see e.g, 
iKaised 119871 ; iPraton. Melott fc McrCeelll997h . We then ap- 
ply a friends-of-friends algorithm with a spherical linking 
volume to the set of remaining galaxies and group centres. 
The radius of this linking sphere is chosen to be b times the 
mean inter galactic separation at that redshift, as defined in 
Eqn. 2.7 of lEke et ail l|2004al) . Small linking lengths would 
lead to many small systems, whereas very large ones would 
lead to percolation, and a single large connected compo- 
nent encompassing everything in the survey. An intermedi- 
ate value for b will lead to a more useful description of the 
structures present in the survey. This two-pass procedure 
provides a new and simple way to define connected struc- 
tures in galaxy redshift surveys. 

Fig. [T] shows how the number of galaxies in connected 
structures and the number of galaxies in the largest system 
vary with b for both the NGP and SGP. Both the NGP and 
SGP regions show a rapid growth of their largest system as 
b increases beyond ~ 0.8. Fig. [2] shows the variation of the 
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Figure 2. Variation in the number of structures extracted by 
the algorithm as the relative linking length, b, varies. The red 
line represents the first derivative of this function, corresponding 
to the rate of change of system number. We adopt a relative 
linking length, b = 0.69, close to this minimum, as denoted by the 
dot-dashed blue line. This corresponds to systems approximately 
bounded by a surface with a galaxy number overdensity of ~ 1.5. 



total number of connected structures and its first derivative. 
We pick 6 = 0.69 as a value that gives rise to an interest- 
ingly large range of system sizes. This corresponds to finding 
structures bounded by an irregular surface that has an over- 
density of Ap/p = 3/(2nb 3 ) « 1.5 (|Cole fc Lacevlll996l ). 

This choice is close to the point at which N/b = 0, 
where the growth in the number of systems arising from 
single galaxies becoming linked matches the decrease caused 
by merging the structures together. The resulting systems 
are shown in Fig. |3 

The abundance and extent of survey-sized connected 
structures will depend upon the geometry of the survey to 
which this algorithm is applied. Thus, while this technique 
is appropriate for comparing an observed data set with a 
mock catalogue of that particular survey, care is required 
when trying to infer the physical properties and abundance 
of the largest structures in the underlying distribution. 



2.3 Connected structure luminosities 

We would like to quantify the sizes of the objects found using 
this method in a way that (a) does not depend explicitly on 
the magnitude limit of the survey and (b) assigns the same 
size to a particular structure, independently of the redshift 
at which it is placed. Thus, rather than merely counting 
the number of galaxies present in each system, we define a 
luminosity that takes into account the flux limits of the sur- 
vey. The angular variation of the flux limit in the 2dFGRS is 
such that it changes over the length of the elongated filamen- 
tary structures. Consequently, it is necessary to convert the 



observed luminosity of each galaxy to the total luminosity 
that would have been seen without any flux limits, rather 
than correcting the observed luminosity of the system as 
a whole. This is done assuming that the galaxy luminosity 
fu nction &(L) is g i ven b y the Schechter function determined 
bv lNorberg et al.l (2002). i.< 



and using 

Lobs 



f~L$(L)dL 
L$(L)dL' 



(L„a) = (10 10 A _ ' ! L S ,-1.21) 



(2) 



where the luminosity limits in the integral in the denom- 
inator reflect the upper and lower flux limits evaluated at 
the redshift of the galaxy. L COI and Lobs represent the cor- 
rected and observed galaxy luminosities respectively, taking 
into account the k + e correction in a manner similar to 
iNorberg et ail l|2002h : 



k + e = 



z + 6z 2 
1 + 8.9z 2 - 5 ' 



(3) 



The total luminosity is calculated by summing up all the cor- 
rected galaxy luminosities for galaxies within that system, 
taking into account the weighting factors that describe the 
local incompleteness of the survey. Given the flux limit of the 
2dFGRS, the fraction of the total luminosity that is observed 
drops beneath a half at redshifts exceeding z ~ 0.12. For 
this reason, we will restrict our analysis to structures with 
Z ^ 0.12. Fig. U shows the systems found within the 2dF- 
GRS colour coded according to their luminosity. Comparing 
with Fig. O it is apparent how the luminosity picks out 
structures at larger distances than the membership, which 
includes only galaxies within the flux limits. 



2.4 Mock catalogues 

In order to address how well the observed distribution of 
system luminosities compares with that predicted for a 
2dFGRS-like survey of a ACDM cosmological model, we 
need to create mock galaxy surveys. This is done using a 
combination of the BASI CC d ark matter simulation de- 
J 20081). and the semi-analytical 



scribed by lAngulo et al 



model o f lBaugh et al 1 (l2005hT which is a development of that 



introduced bv lCole et ail l|2000h . 

In brief, the BASICC simulation contains 1448 3 parti- 
cles in a 1340/i _1 Mpc-long cube of a ACDM model with 
n M = 0.25, Qa = .75, ft b = 0.045, h = 0.73 and a s = 0.9. 
lAngulo et al.l l|2008f ) used a friends-of-friends (FOF) algo- 
rithm with a linking length of 0.2 times the mean inter- 
particle separation to define haloes. Requiring 10 particles 
to resolve a halo implies that the minimum resolvable halo 
mass is 5.5 x W 11 h~ 1 M e . All of these haloes were popu- 
lat ed with galaxies ac cording to the semi-analytical model 
of iBaugh et al.l (|2005l ) . Galaxies that reside in unresolved 
haloes are randomly placed onto dark matter particles out- 
side resolv e d hal oes, according to the method described by 
ICole et all (|2005T ). 

While the galaxy luminosity function produced by this 
model is close (within 0. 3 magnitudes a r ound L*) to that 
observed in the 2dFGRS INorberg et"afl (|2002h . we never- 
theless apply a luminosity-dependent shift in luminosities 
so that the cube of model galaxies has exactly the same 
luminosity function as the 2dFGRS. Only galaxies with 
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Figure 3. Spatial distribution of 2dFGRS galaxies in connected structures for systems with average redshift z ^ 0.12 in the RA-2 plane. 
These systems contain at least two galaxies and dot colours represent the weighted number of galaxies in the structure, where this weight 
takes into account the local angular incompleteness. 



L > 1.4 x 10 8 }i~ 2 ~Lq are included in the model cube. For 
the flux limit of the 2dFGRS, this implies that the mock 
galaxy catalogues will be missing low luminosity galaxies at 
z <C 0.025. This corresponds to just under one per cent of 
the total volume being considered. 

Fifty mocks of the 2dFGRS were created from this 
cube of model galaxies as follows: 

a) a random observer location and direction were chosen, 

b) volume-limited NGP and SGP surveys were created 
using periodic replicas of the cube if required. Note that 
the depth of the 2dFGRS is less than half the length of the 



BASICC simulation cube, and its effective volume is less 
than 10 -3 of the volume of the BASICC simulation, 

c) galaxies were removed according to the position- 
dependent 2dFGRS flux limits and completeness masks, 

d) the remaining galaxies were assigned a redshift according 
to their peculiar velocity in the simulation. 

Previous studies have shown that this semi-analytic 
model tends to place too many low luminosity galaxies into 
galaxy clusters (jEke et"aLll2004bl ; iGilbank fc Baloghl 120081 ; 
iKim et al.lfioogh . Given that the global luminosity function 
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Figure 4. Spatial distribution of 2dFGRS galaxies in connected structures in the RA-z plane. Colours represent the total system 
luminosity in units of log 10 (L/h~ 2 Lq). 



has been forced to match that in the observations, this im- 
plies that the model will lack low luminosity galaxies in lower 
density regions. This known problem will affect the structure 
finder. 

To try and reduce the impact of this known difference 
between the model and observations, we allow ourselves the 
freedom to jettison a smaller fraction of galaxies from the 
model groups than given by Eqn. [T] for the real 2dFGRS. 
This decreases, in the vicinity of the groups, the number 
density of points used for the structure-finding sweep of the 
friends-of-friends algorithm to an amount similar to that in 



the real survey. We achieve this in the model by multiplying 
f(z), as given by Eqn. [U by a constant, x- X > 1 implies 
that a higher fraction of galaxies are retained in the groups. 

In order to determine an appropriate value for x, we 
have measured the distribution of system orientations de- 
fined as 

* = tan_1 (^)' W 

where Al z represents the range of the member galaxies in 
the redshift direction and Al^, is the larger of the ranges 
of member galaxies in the RA and dec directions. Thus, 
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8 — tv/2 for a radial object and for one lying perpendicular 
to this. We use the greater of Al z and AZ^ to describe the 
scale size of the connected structure. 

Fig. [S] shows the cumulative probability distributions 
of system orientations for structures containing at least 20 
galaxies in the 2dFGRS and those recovered from 10 mock 
surveys using three different values of xQ. It is apparent that, 
when treated in the same way as the real data (i.e. \ = 1), 
the model contains too many objects aligned along the line 
of sight. This is a result of too many low luminosity galax- 
ies being placed into the redshift space volumes occupied by 
the model groups. When the 'interloping' galaxies are jetti- 
soned from the groups found in the mock catalogues, enough 
of these extra low luminosity galaxies are placed along radial 
lines that they bias the orientation distribution. Increasing 
X retains a higher fraction of the initially grouped galaxies 
in the groups, reducing the number of 'interlopers' returned 
to the field, and decreasing the number of radially aligned 
objects found in the second pass of the friends-of-friends al- 
gorithm. A value of \ ~ 1-15 produces a mock orientation 
distribution that is, according to a K-S test, indistinguish- 
able from that found in the 2dFGRS. This is chosen as the 
default value for these 2BASICC mocks throughout this pa- 
per. 

An additional set of 22 mock 2dFGRS sur veys created 
from the Hubble Volum e by ICole et al.l (|2000h . as used by 
iGaztanaga et all l|2005h . are also analysed to give some idea 
of the systematic differences resulting from different simu- 
lations and implementations of the galaxy formation mod- 
elling. For these mock catalogues, a value of x — 1-11 was 
required in order to recover the 2dFGRS system orientation 
distribution. 



3 RESULTS 

In this section, we will first describe the main features and 
properties of the connected structures found in the 2dFGRS 
and then compare with the results from the ACDM mock 
surveys. This comparison will encompass both the whole 
population of connected systems as well as the largest ob- 
jects. 



3.1 Connected systems in the 2dFGRS 

Projections onto the right ascension-redshift plane of all of 
the connected systems found in the 2dFGRS are shown in 
Figs[3]and|4] A total of 95, 010 galaxies are linked into 7, 603 
systems containing at least two members and mean redshifts 
no greater than 0.12. Of these, 3, 018 contain only two mem- 
bers. Almost 87 per cent of galaxies at z ^ 0.12 are placed 
into a connected structure. One large filamentary-structure 
stands out in each of the NGP and SGP wedges. These 
systems trace out t he same overden sities apparent in the 
2PIGG distribution (jEke et al.ll2004bh . the smoothed galaxy 



1 One might imagine that randomly oriented connected struc- 
tures would be uniformly distributed with 9. However, since sys- 
tems often contain more than two galaxies, which are generally 
not colinear, the definition of 9 leads to connected structures pref- 
erentially avoiding values towards the ends of the range [0,7r/2], 
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Figure 5. Cumulative probability distributions of system orien- 
tations for all objects containing at least 20 galaxies. Results are 
shown for the 2dFGRS and for averages of 10 2BASICC mocks. 
The mock distributions are derived from three different choices 
of x = 1-0, 1.15 and 1.30, as indicated in the legend. 
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Figure 6. The relation between object luminosity, L, and N ga i, 
the weighted number of galaxies in objects with z ^ 0.12. 



density m ap llBaugh et ah 2004h and the reconstructed den- 
sity field (jErdogdu et alj|2004f ) of the 2dFGRS. The largest 
NGP object, at z ~ 0.08, corresponds t o the large RA en d 
of the 'Sloan Great Wall' highlighted bv lGott et all |2005l V 
At a total bj-band luminosity of ~ 7.8 x 10 13 h~ 2 L , this 
is about 20 per cent more luminous than the largest equiv- 
alent in the SGP, which lies at z ~ 0.11 and RA ~ 10°. 
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Table 1. Properties of connected structures with z SC 0.12 identified in the 2dFGRS and mock surveys. Mock values are the mean 
over all 50 2BASICC and 22 Hubble Volume mock surveys with the uncertainties being the standard deviation of individual surveys 
from these mean values. N is the total number of connected systems within the catalogue, / the fraction of galaxies out to z = 0.12 in 
systems and iV ga j is the total number of galaxies out to z = 0.12. The fifth and sixth columns describe the average and maximum object 
luminosities (in units \og 1Q {L/h~ 2 Lq)). We give the comoving scale l m ax of the largest structure identified in the survey in the final 
column. This is defined as the largest in extent of galaxy members in the redshift, RA or dec directions. 

Survey N / log 10 (JV ga i) log 10 L log 10 L max lmax(h~ 1 Mpc) 

2dFGRS 7603 87.7% 5.06 11.16 13.89 198 

2BASICC 8023 ±250 85.9 ± 0.8% 5.07 ±0.04 11.08 ±0.03 13.44^;^ 81 ± 19 

HV 8253 ± 135 82.0 ± 0.8% 5.11 ±0.02 11.10 ±0.03 13.55^0 20 93 ± 27 
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Figure 7. The distribution of system luminosities with increasing 
redshift. 



The extents in RA of these largest NGP and SGP systems 
in comoving coordinates are ~ 198/i _1 Mpc and 99/i _1 Mpc 
respectively. While the NGP systems contains twice as many 
members as that in the SGP, it is very nearly broken into 
two pieces around RA ~ 185°, where the galaxy density 
drops off considerably. 

More locally, a continuation to lowe r declinations of the 
CfA Great Wall l|Geller fc Huchralll989f ) is seen at z ~ 0.02 
in the NGP, although our algorithm breaks this up into a 
few different components. 

Some average and extreme properties of the systems 
identified in the 2dFGRS are listed in Table [T] In more de- 
tail, the correlation between the luminosity and weighted 
(to account for the local angular incompleteness of the sur- 
vey) membership of connected structures is shown in Fig.© 
The second largest system contains at least twice as many 
members as the third largest one, and almost 3 times as 
much luminosity, making the largest NGP and SGP struc- 
tures stand out from the remaining systems. The scatter 
around the mean relation reflects the range of redshifts in 
the flux-limited survey. While the object luminosity is cor- 
rected to take account of this flux limit, the weighted number 
of galaxies is not. 



Figure 8. The weighted number of galaxies in the largest struc- 
ture for NGP (black) and SGP (red) systems with z < 0.12, in- 
cluding any members with redshifts greater than this limit subject 
to the mean system redshift remaining below it. In both cases the 
dashed lines show the 2dFGRS data, whilst solid lines represent 
the mean number of galaxies in the largest object across 50 mock 
surveys. In the NGP case, we include also error bars representing 
the standard deviation of these surveys around the mean. 



Fig. [7] shows how connected system luminosity varies 
with redshift, with the lower envelope representing the to- 
tal luminosity of 2 galaxies at the flux limit. The geometry 
of the survey precludes finding very luminous structures at 
low redshift because of the small volume sampled, but the 
greater volume available at larger redshifts is sufficient to 
contain larger, more luminous filamentary structures. The 
largest NGP and SGP systems are once again conspicuous 
at the top of the figure. 



3.2 Comparison with mock catalogues 

We follow almost the same procedure to define connected 
structures in the 50 2BASICC mocks, with the only dif- 
ference being that the fraction of galaxies retained in the 
groups, f(z) in Eqn.[T] is increased by a factor \ — 1-15, as 
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Figure 9. The redshift distribution of all objects with at least two 
members. The blue line represents the mean number of connected 
structures as a function of redshift across 50 mock surveys, with 
error bars representing their standard deviation around the mean. 
The black line corresponds to systems detected in the 2dFGRS. 



described in Sj2] The impact of this choice on the results is 
discussed later. 

Fig. [5] shows how the number of galaxies in the most 
populated system grows as b is increased for each survey re- 
gion. While the behaviour is broadly similar to that of the 
largest filament in the 2dFGRS, the onset of percolation in 
mock catalogues is delayed by about 0.1 times the mean 
intergalaxy separation. As a consequence, at b = 0.69, the 
largest 2dFGRS system (located in the NGP) is a signifi- 
cant outlier, being more populated than the corresponding 
structure in all but one of the 50 mock catalogues. The mock 
system containing more galaxies than the largest one in the 
2dFGRS is placed at z ~ 0.01 and is much less luminous. It 
is the result of the randomly chosen observer being put very 
near to a large galaxy cluster. 

The larger value of x adopted for mock surveys means 
that the number of galaxies and group centres used for the 
algorithm is on average ~ 13 per cent lower than in the 
case of the 2dFGRS. If we were to use a value of b that 
was correspondingly larger (i.e. scaled by the inverse cube 
root of the number of points to be 0.72) then this does not 
significantly affect the discrepancy between the number of 
galaxies in the most populated systems in the mock or real 
2dFGRS. 

The redshift distribution of the structures is shown in 
Fig. [9] At z < 0.025, where the mock catalogues are known 
to be missing low luminosity galaxies, the mock surveys con- 
tain fewer objects than the real 2dFGRS. The main reason 
for this is actually not the incompleteness in the mocks, but 
the fact that too many low luminosity galaxies are placed 
into large groups, reducing the number available to form 
other small systems. This local volume represents only a 
small fraction of the survey. 



For redshifts greater than 0.04, the number of real 2dF- 
GRS structures is typically slightly below the mean of the 
50 mock surveys. This is reflected in the first column of Ta- 
ble [TJ which shows that the total number of systems in the 
2dFGRS is (1 — 2)cr beneath that of the mocks, despite the 
fact that a slightly higher fraction of galaxies are placed into 
the 2dFGRS structures. The total number of galaxies in the 
2dFGRS and mock surveys matches well by construction, 
but the excess of mock galaxies placed into groups means 
that fewer are available in lower density regions for linking 
together small systems. The relatively high fraction of galax- 
ies placed into structures and low number of structures in 
the 2dFGRS leads to a larger mean luminosity. This differ- 
ence can be removed by not including the two most luminous 
systems in the 2dFGRS in the calculation. 

The distribution of system luminosities is shown in 
Fig. 1101 The mocks have a relative lack of structures be- 
neath L ~ 10 9 ft" 2 L,0, more than the real 2dFGRS at 
L ~ 3 x 10 10 h~ 2 Lq, which is the peak of the distribution 
and corresponds approximately to two L* galaxies, and a 
paucity of filamentary systems like the largest ones in the 
2dFGRS. As stated above, the difference between the model 
and real distributions at low luminosities arises mostly be- 
cause the lowest luminosity galaxies in the model are more 
likely to be placed into larger groups and hence are not 
available to form very low luminosity systems. The deficit 
of lower luminosity galaxies outside groups impacts in two 
ways upon the most luminous model structures. They tend 
to gain luminosity because their groups are slightly more 
luminous than those of corresponding mass in the 2dFGRS. 
However, the lack of low luminosity galaxies in the lower 
density regions, makes it less likely that large structures will 
join together. It is this second effect that is more important, 
resulting in none of the 50 mock surveys yielding two sys- 
tems as luminous as the two most luminous in the 2dFGRS. 

Given that many differences between the real and mock 
structure luminosity distributions result from the different 
spatial distributions of low luminosity galaxies in the real 
and mock surveys, and that we have used a different value 
of x f° r rea -l an d mock surveys, one might reasonably ask 
what changes when x = 1 is used for the mocks. This is 
shown in Fig. 1111 where three different x values are used 
for the 2BASICC mocks. Increasing x retains more galaxies 
in the groups, leaving fewer galaxies to help the algorithm 
link together larger structures. This leads to a decrease in 
the luminosity of the most luminous systems. Decreasing 
X leads to an increase in the luminosity of the most lumi- 
nous systems, but even for x ~ 1 there are still no surveys 
with two structures at least as luminous as the second most 
luminous 2dFGRS system. Nevertheless, we do obtain two 
surveys with a connected system more luminous than the 
brightest 2dFGRS system. However, as shown in Fig. [5] this 
comes at the expense of producing a set of objects that are 
significantly more radially oriented than those found in the 
2dFGRS. 

Also shown in Fig. QT] is the distribution of sys- 
tems found in th e 22 Hubble Volume mock catalogues of 
ICole et al. I <|2000h . After tuning x to be 1.11 to recover the 
2dFGRS structure orientation distribution, the Hubble Vol- 
ume mocks are broadly similar to the 2BASICC ones, with 
the abundance of the most luminous systems being almost 
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Figure 10. The distribution of luminosities for all structures with 
a minimum membership of two, out to a rcdshift z = 0.12. The 
red lines show the distribution for each of the 50 2BASICC mock 
catalogues. The black line shows the distribution for systems in 
the 2dFGRS. 
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Figure 11. Structure luminosity distributions for different values 
of x in the 2BASICC surveys, for the Hubble Volume catalogues 
and the 2dFGRS. Mock survey distributions have been averaged 
over the 50 surveys in the 2BASICC and the 22 in the Hubble 
Volume simulations. 



unchanged. The bottom row of table[T]contains statistics for 
the systems found in these Hubble Volume mocks. 



4 CONCLUSIONS 

We have described a simple algorithm with which to define 
connected structure within galaxy redshift surveys, and ap- 
plied it to the 2dFGRS. This algorithm explicitly addresses 
the redshift-space distortion associated with rapidly moving 
galaxies within groups and clusters. The 7, 603 2dFGRS con- 
nected structures at z ^ 0.12 containing at least two mem- 
bers range up to ~ 200/i _1 Mpc in extent, but are mostly 
associations of two L* galaxies. Quantifying object sizes via 
their total luminosities, corrected for the survey flux limits, 
we find that the largest systems are filamentary in nature 
and have 6j luminosities of almost 10 14 /i _2 Lq. 

Applying the same algorithm to ACDM mock 2dFGRS 
catalogues, constructed using large-volume dark matter 
simula tions and the semi-analytical model of iBaugh et alj 
(2005), we find a broadly similar distribution of structures 
to those in the real data. There are, however, a few differ- 
ences in detail. Many of these result from the fact that the 
model places too many L <J L* galaxies into groups and 
clusters compared with the 2dFGRS. This biases the ori- 
entation distribution of the systems containing at least 20 
galaxies to contain more radially aligned objects in the mock 
survey than in the 2dFGRS. Applying a crude correction to 
the algorithm to enable it to recover the same orientation 
distribution in the mock survey as it does in the 2dFGRS 
leads to the largest mock structures being significantly less 
luminous than those in the 2dFGRS. 

It is clear that at least some of the differences between 
the properties of the structures in the 2dFGRS and the 
mock catalogues arise from inadequacies in the galaxy for- 
mation model that was used to construct the mocks. We 
have attempted to overcome these inadequacies as far as 
possible through empirical corrections. Our analysis indi- 
cates that the largest filamentary structures seen in the 2dF- 
GRS are not reproduced in the mock catalogues. However, 
while this discrepancy could signal a failure of the standard 
ACDM cosmological model on large scales, it seems more 
plausible that it reflects a shortcoming in the predictions 
of our models of galaxy formation for the abundance and 
spatial distribution of galaxies on small scales. 
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